Unified Named Entity Recognition as Word-Word Relation Classification
نویسندگان
چکیده
So far, named entity recognition (NER) has been involved with three major types, including flat, overlapped (aka. nested), and discontinuous NER, which have mostly studied individually. Recently, a growing interest built for unified tackling the above jobs concurrently one single model. Current best-performing methods mainly include span-based sequence-to-sequence models, where unfortunately former merely focus on boundary identification latter may suffer from exposure bias. In this work, we present novel alternative by modeling NER as word-word relation classification, namely W^2NER. The architecture resolves kernel bottleneck of effectively neighboring relations between words Next-Neighboring-Word (NNW) Tail-Head-Word-* (THW-*) relations. Based W^2NER scheme develop neural framework, in is modeled 2D grid word pairs. We then propose multi-granularity convolutions better refining representations. Finally, co-predictor used to sufficiently reason perform extensive experiments 14 widely-used benchmark datasets overlapped, (8 English 6 Chinese datasets), our model beats all current top-performing baselines, pushing state-of-the-art performances NER.
منابع مشابه
Effective Word Representation for Named Entity Recognition
Recently, various machine learning models have been built using word-level embeddings and have achieved substantial improvement in NER prediction accuracy. Most NER models only take words as input and ignore character-level information. In this paper, we propose an effective word representation that efficiently includes both the word-level and character-level information by averaging its charac...
متن کاملTransliterated Named Entity Recognition Based on Chinese Word Sketch
One of the unique challenges to Chinese Language Processing is cross-strait named entity recognition. Due to the adoption of different transliteration strategies, foreign name transliterations can vary greatly between PRC and Taiwan. This situation poses a serious problem for NLP tasks: including data mining, translation and information retrieval. In this paper, we introduce a novel approach to...
متن کاملDeep learning with word embeddings improves biomedical named entity recognition
Motivation Text mining has become an important tool for biomedical research. The most fundamental text-mining task is the recognition of biomedical named entities (NER), such as genes, chemicals and diseases. Current NER methods rely on pre-defined features which try to capture the specific surface properties of entity types, properties of the typical local context, background knowledge, and li...
متن کاملSemi-supervised Bio-named Entity Recognition with Word-Codebook Learning
We describe a novel semi-supervised method called WordCodebook Learning (WCL), and apply it to the task of bionamed entity recognition (bioNER). Typical bioNER systems can be seen as tasks of assigning labels to words in bioliterature text. To improve supervised tagging, WCL learns a class of word-level feature embeddings to capture word semantic meanings or word label patterns from a large unl...
متن کاملChinese Word Segmentation and Named Entity Recognition by Character Tagging
This paper describes our word segmentation system and named entity recognition (NER) system for participating in the third SIGHAN Bakeoff. Both of them are based on character tagging, but use different tag sets and different features. Evaluation results show that our word segmentation system achieved 93.3% and 94.7% F-score in UPUC and MSRA open tests, and our NER system got 70.84% and 81.32% F...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i10.21344